Add OpenTelemetry instrumentation #59

andmat900 · 2024-04-10T12:39:11Z

Applicable Issues

Add tracing with OpenTelemetry etos#228
Depends On Add tracing etos-library#27

Description of the Change

This change adds OpenTelemetry instrumentation to etos-suite-runner.

Alternate Designs

Possible Drawbacks

Sign-off

Developer's Certificate of Origin 1.1

By making a contribution to this project, I certify that:

(a) The contribution was created in whole or in part by me and I
have the right to submit it under the open source license
indicated in the file; or

(b) The contribution is based upon previous work that, to the best
of my knowledge, is covered under an appropriate open source
license and I have the right under that license to submit that
work with modifications, whether created in whole or in part
by me, under the same open source license (unless I am
permitted to submit under a different license), as indicated
in the file; or

(c) The contribution was provided directly to me by some other
person who certified (a), (b) or (c) and I have not modified
it.

(d) I understand and agree that this project and the contribution
are public and that a record of the contribution (including all
personal information I submit with it, including my sign-off) is
maintained indefinitely and may be redistributed consistent with
this project or the open source license(s) involved.

Signed-off-by: Andrei Matveyeu, [email protected]

projects/etos_suite_runner/src/etos_suite_runner/esr.py

t-persson · 2024-04-22T10:54:26Z

projects/etos_suite_runner/src/etos_suite_runner/lib/executor.py

+        span_name = "start_execution_space"
+        with self.tracer.start_as_current_span(span_name) as span:
+            span.set_attribute("executor_id", executor["id"])
+            span.set_attribute("request", dumps(request, indent=4))


Is indent=4 necessary? It increases the request size to the open telemetry collector

The problem is that it isn't readable. At least in Jaeger where it is shown as a simple text blob.

The indent argument is now removed.

t-persson · 2024-04-22T10:55:58Z

projects/etos_suite_runner/src/etos_suite_runner/lib/suite.py

+        # OpenTelemetry context needs to be retrieved here:
+        # the subsuite is running in a separate process


Is this comment still valid?

The comment is relevant, I have clarified it in the latest update.

projects/etos_suite_runner/src/etos_suite_runner/lib/suite.py

t-persson · 2024-04-22T11:08:31Z

projects/etos_suite_runner/src/etos_suite_runner/otel_tracing.py

+def get_current_context() -> opentelemetry.context.context.Context:
+    """Get current context (propagated via environment variable OTEL_CONTEXT)."""
+    carrier = {}
+    LOGGER.info("Current OpenTelemetry context env: %s", os.environ.get("OTEL_CONTEXT"))
+    for kv in os.environ.get("OTEL_CONTEXT", "").split(","):
+        if kv:
+            k, v = kv.split("=", 1)
+            carrier[k] = v
+    ctx = opentelemetry.propagate.extract(carrier)
+    LOGGER.info("Current OpenTelemetry context %s", ctx)
+    return ctx


We do something similar to this in the ETOS library where we utilize the built-in textmap propagator.
Called here: https://github.com/eiffel-community/etos-library/blob/main/src/etos_lib/eiffel/subscriber.py#L104-L106
Extracted here: https://github.com/eiffel-community/etos-library/blob/main/src/etos_lib/eiffel/subscriber.py#L37-L52

I believe that the one in ETOS library is simpler.

In the latest update I have made it more idiomatic. It does not make the code simpler, but it may look better.

Dockerfile

fredjn · 2024-04-22T12:47:13Z

projects/etos_suite_runner/src/etos_suite_runner/lib/suite.py

+            timeout = time.time() + self.etos.debug.default_test_result_timeout
+            try:
+                while time.time() < timeout:
+                    time.sleep(10)


Isn't doing mandatory sleeping 10 a bit suboptimal? Couldn't we just have shorter sleep inside the 'not self.started'-loop?

I'd avoid changing it here. This is old code, I just had to change the indent due to opentelemetry span recording.

That sleep is there so that we do not spam the event repository too much. It is needed for both the if not self.started and further down if self.finished is False.
This loop is for waiting for the test runner to start and then to finish.

andmat900 requested a review from a team as a code owner April 10, 2024 12:39

andmat900 requested review from t-persson and fredjn and removed request for a team April 10, 2024 12:39

andmat900 force-pushed the 20240402_opentelemetry branch from c12ae03 to e187c17 Compare April 19, 2024 12:29

t-persson requested changes Apr 22, 2024

View reviewed changes

fredjn reviewed Apr 22, 2024

View reviewed changes

andmat900 force-pushed the 20240402_opentelemetry branch 4 times, most recently from f9fd9aa to 84f150b Compare April 26, 2024 10:44

andmat900 force-pushed the 20240402_opentelemetry branch from 84f150b to 11b187c Compare May 3, 2024 05:36

andmat900 added 9 commits May 3, 2024 07:37

Add OpenTelemetry instrumentation

32247e3

Minor fixes

2390b2d

Minor fixes

6bc6cf5

Minor fix

5d37536

Minor fixes

6282ae4

Minor fixes

8f4bcfc

Review updates

0fc4407

Code review changes

aed50b7

Code review changes

ebacc99

andmat900 force-pushed the 20240402_opentelemetry branch from 11b187c to ebacc99 Compare May 3, 2024 05:41

Code review changes

68a44a2

andmat900 requested a review from fredjn May 3, 2024 09:27

fredjn approved these changes May 6, 2024

View reviewed changes

andmat900 merged commit 9cd62cc into eiffel-community:main May 7, 2024
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add OpenTelemetry instrumentation #59

Add OpenTelemetry instrumentation #59

andmat900 commented Apr 10, 2024

t-persson Apr 22, 2024

andmat900 Apr 26, 2024

andmat900 May 3, 2024

t-persson Apr 22, 2024

andmat900 Apr 25, 2024

andmat900 Apr 26, 2024

t-persson Apr 22, 2024

andmat900 Apr 26, 2024

fredjn Apr 22, 2024

andmat900 Apr 25, 2024

t-persson Apr 26, 2024

		# OpenTelemetry context needs to be retrieved here:
		# the subsuite is running in a separate process

Add OpenTelemetry instrumentation #59

Add OpenTelemetry instrumentation #59

Conversation

andmat900 commented Apr 10, 2024

Applicable Issues

Description of the Change

Alternate Designs

Possible Drawbacks

Sign-off

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment